Day30 - 模型複雜度分析 - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

2021 iThome 鐵人賽

DAY 30

AI & Data

機器學習應用於語音相關服務系列第 30 篇

Day30 - 模型複雜度分析

13th鐵人賽

pwhsiao

2021-10-12 20:49:39

1474 瀏覽

分享至

在最後一天的內容中，我們會以參數量、乘法數、訓練過程中每一個epoch所需的時間與測試過程中每一筆資料樣本所需的時間來評估靜態與動態模型的複雜度。MLP、CNN 及 LSTM-RNN 乘法運算量計算方式如下:

MLP: $M\times N$
其中，M 、N 分別表示輸入、輸出維度。
CNN: $o_{H}\times o_{W}\times o_{C}\times f_{W}\times f_{H}\times i_{C}$
其中， $o_{H}$ 、 $o_{W}$ 、 $o_{C}$ 分別表示輸出特徵圖的高、寬與channel 數； $f_{H}$ 、 $f_{W}$ 分別表
示卷積核的高、寬； $i_{C}$ 則代表輸入 channel 數。
LSTM-RNN: $4\times (K+Z)]\times Z+3\times Z^{2}$
其中，K 、Z 分別表示輸入、輸出維度。

三種模型的複雜度分析如下：

#hidden layers | # parameters | # mul. operations | training time per epoch | test time per data
------------- | -------------
1 | 11,705 | 11,670 | 1s | 0.007s
2 | 12,635 | 12,570 | 1s | 0.008s
3 | 13,565 | 13,470 | 1s | 0.01s
表1: 靜態模型 MLP 複雜度分析

model | # parameters | # mul. operations | training time per epoch | test time per data
------------- | -------------
Basic CNN | 24,675 | 296K | 1s | 0.014s
Multi-scale CNN | 139,805 | 6.5M | 3s | 0.035s
Multi-scale CNN with attention | 149,405 | 6.7M | 3s | 0.045s
表2: 靜態模型 CNN 複雜度分析

model | # parameters | # mul. operations | training time per epoch | test time per data
------------- | -------------
LSTM-RNN (last-frame only) | 27,389 | 37,628 | 423s | 0.8s
LSTM-RNN (mean-pooling over time) | 27,389 | 37,628 | 434s | 0.8s
LSTM-RNN with attention | 25,675 | 36,230 | 215s | 0.4s
表3: 動態模型 LSTM-RNN 複雜度分析

這 30 天的語音辨識&語音情緒辨識的旅程就到這邊了，感謝大家的閱讀&指教，下台一鞠躬!!